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55© F antigens of the human immuno-deficlency virus, and their applications. 

® The invention concerns purified expression products of DNA sequences derived of the genome of the LAV 
2j virus. It relates more particularly to a glycoprotein having a molecular weight of about 110,000 or antigen of 
lower molecular weight derived from the preceding one, which purified product possesses the capacity of being 
recognised by serums of human origin and containing antibodies against the LAV virus. It also relates to other 
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purified polypeptides derived either from the abovesaid glycoprotein or of core proteins of LAV virus. The 
glycoprotein can be used for the production of immunogenic composition capable of neutralizing the LAV virus. 
All of the abovesaid glycoprotein or polypeptides can be used as antigens useful in the diagnosis of LAV 
antibodies in sera of patients. 
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ANTIGENS, PARTICULARLY ENVELOPE ANTIGENS OF THE VIRUS OF LYMPHADENOPATHIES AND OF 
THE ACQUIRED IMMUNO-DEPRESSIVE SYNDROME, AND VIRUS, PROCESS FOR PRODUCING VIRUS 
ENVELOPE ANTIGENS, USE OF THIS ANTIGENS IN THE PREPARATION OF IMMUNOGENIC COMPOSI- 
TIONS OR FOR THE DIAGNOSIS OF THE PRESENCE OF ANTIBODIES AGAINST THIS VIRUS 



The present invention relates to antigens, particularly in a purified form, of the virus of lymphadenopath- 
ies (denoted below by the abbreviation LAS) and of the acquired immuno-depressive syndrome (denoted 
below by the abbreviation AIDS), to a process for producing these antigens, particularly antigens of the 
envelopes of these viruses. The invention also relates to polypeptides, whether glycosylated or not. 
5 encoded by said DNA sequences. 

The causative agent if LAS or AIDS, a retrovirus, has been identified by F. BARRE-SINOUSSI et al, 
Science, 220, 868 (1983). It has the following characteristics. It is T-lymphotropic; its prefered target is 
constituted by Leu 3 cells (or T4 lymphocytes) ; It has reverse transcriptase activity necessitating the 
presence of Mg + and exhibits strong affinity for poly(adenylate-oligodeoxy-thymidylate) (poly(A)-oligo(dT)- 
70 12-18). It has a density of 1.16-1.17 in a sucrose gradient, an average diameter of 139 nanometers; and a 
nucleus having an average diameter of 41 nanometers. Antigens of said virus, particularly a protein p25 are 
recognised immunologically by antibodies contained in serums taken up from patients afflicted with LAS or 
AIDS. The p25 protein, which is a core protein, is not recognised immunologically by the p24 protein of the 
HTLVI and II viruses. The virus is also free of a p19 protein which is immunologically cross-reactive with the 
75 p19 proteins of HTLVI and HTLVII. 

Retroviruses of this type (sometimes denoted by the generic abbreviation LAV) have been filed in the 
National Collection of Micro-organism Cultures of the INSTITUT PASTEUR of Paris, under numbers 1-232, I- 
240 and 1-241. Virus strains similar to LAV in all respects from the morphological and immunological point 
of view have been isolated in other laboratories. Reference is made by way of examples to the retrovirus 
20 strains named HTLV-III isolated by R.C. GALLO et al.. Science. 224, 500 (1984) and by M.G. SARNGAD- 
HARAN et al., Science 224, 506 (1984) respectively and to the retrovirus isolated by M. JAY LEVY et al., 
Science, 225, 840-842 (1984), which virus was designated ARV. For the ease of language the last 
mentioned viruses, as well as others which have equivalent morphological and immunological properties, 
will be designated hereafter under the generic designation H LAV W . Reference is also made to European 
25 patent application filed 14 September 1984, with the priority of British patent application number 83 24800 
filed 15 September 1983 as regards a more detailed description of the LAV retroviruses or the like and of 
the uses to which extracts of these viruses give rise. 

Initially the core antigens were the main antigens of the virus lysates or extracts which were recognised 
by serums of patients infected with AIDS or LAS. in the test systems which had then been used. A p42 
30 protein, presented as consisting of an envelope protein, had been detected too. In the same manner GALLO 
et al disclosed a p41 protein which was also deemed to be on a possible component of the viru envelope. 

Processes for obtaining a LAV virus have also been described. Reference may be made particularly to 
the article already mentioned of F. BARRE-SINOUSSI et al.. as regards the preparation of the virus in T 
lymphocyte cultures derived either from blood, or from the umbilical cord, or also from bone marrow cells 
35 of adult donors in good hearth. This process comprises particularly the following essential steps: 

- producing a viral infection of these T lymphocytes, after activation by a lectin mitogen, with a viral 
suspension derived from a crude supernatant liquor of lymphocytes producing the virus (initially obtained 
from a patient infected with AIDS or LAS), 

- culturing cells infected with TCGF, in the presence of anti-a-interferon sheep serum, 

40 - effecting purification of the virus produced (production starts generally between the 9th and the 15th day 
following infection and lasts from 10 to 15 days), which purification comprises precipitating the virus in 
polyethylenglycol in order to produce a first concentration of the virus, then centrifugating the preparation 
obtained in a 20-60 % sucrose gradient or in an isotonic gradient of metrizanide (sold under the trade mark 
NYCODENZ by NYEGAARD, Oslo) and recovering the virus with the band having a density of 1.16-1.17 in 

45 the sucrose gradient or of 1.10-1.11 in the NYCODENZ gradient 

The LAV virus may also be produced from permanent cell lines of type T, such as the CEM line, or 
from B lymphoblastoid cell lines, such as obtained by the transformation of the lymphocytes derived from a 
healthy donor with the Epstein-Barr virus, for instance as disclosed in French patent application Nr. 84 
07151 filed May 9. 1984. The permanent cell lines obtained produce continuously a virus (designated as 

so LAV-B in the case of the B lymphoblastoid cell lines) which possesses the essential antigenic and 
morphological features of the LAV viruses (except that it is collected in a density band sometimes slightly 
higher than in the preceding case (particularly 1.18) in sucrose. The final purification of the virus can also 
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be carried out in a NYCODENZ gradient 

Reference is made to the article of SchOpbach et al, Science, vol. 224, pages 503-505, 4 May 1984. 
This article discloses a work with respect to different antigens of the HTLV-III retrovirus. 

Results are presented including a reference to an antigen having migrated on an electrophoresis gel at 
s a migration distance corresponding to a molecular weight of approximately 110,000. 

Schupbach et al mentioned that this antigen was detected in a virus preparation but was below limit of 
detection in the cells. 

A method for cloning DNA sequences hybridizable with the genomic RNA of LAS has already been 
disclosed in British Patent Application Nr. 84 23659 filed on September 19, 1984. Reference is hereafter 

70 made to that application as concerns subject matter in common with the further improvements to the 
invention disclosed herein. 

The invention aims at providing purified unaltered virus forms (or viruses less altered by the purification 
procedures resorted to) and processes for obtaining said unaltered purified viruses. 

The present invention further aims at providing additional new means which should not only also be 

75 useful for the detection of LAV or related viruses (hereafter more generally referred to as "LAV viruses"), 
but also have more versatility, particularly in detecting specific parts of the genomic DNA of said viruses 
whose expression products are not always directly detectable by immunological methods. The present 
invention further aims at providing polypeptides containing sequences in common with polypeptides 
comprising antigenic determinants included in the proteins encoded and expressed by the LAV genome 

20 occuring in nature. An additional object of the invention is to further provide means for the detection of 
proteins related to LAV virus, particularly for the diagnosis of AIDS or pre-AIDS or, to the contrary, for the . 
detection of antibodies against, the LAV virus or proteins related therewith, particularly in patients afflicted 
with AIDS or pre-AIDS or more generally in asymtomatic carriers and in blood-related products. Finally the 
invention also aims at providing immunogenic polypeptides, and more particularly protective polypeptides 

25 for use in the preparation of vaccine compositions against AIDS or related syndroms. 

The present invention relates to additional DNA fragments, hybridizable with the genomic RNA of LAV 
as they will be disclosed hereafter, as well as with additional cDNA variants corresponding to the whole 
genomes of LAV viruses. It further relates to DNA recombinants containing said DNAs or cDNA fragments. 
An unaltered purified LAV retrovirus distinguishes from those which have been defined above, in that it 

30 includes an amount of one or several envelope antigens, sufficient to be visualized when the virus is 
labelled with 35 S-cystein, free of unlabelled cystein in a proportion of 200 microcuries per ml of medium, 
these antigens, among which particularly glycoproteins, are recognised selectively in vitro by serums of 
patients affected with SIDA or SLAs or by the serums of asymptomatic carriers of the virus. 

A preferred antigen according to the preceding definition obtainable from a lysate of this virus (or by 

35 gentle scouring of the envelopes of the virus) is a glycoprotein having a molecular weight of the order of 
110,000 daltons, as determined by its migration distance in comparison with the distances of migrations, in 
a same migration system, of standard proteins having known molecular weights. Particularly comparative 
measurements were made on a 12.5 % polyacryiamid gel under a voltage of 18 V for 18 hours, upon using 
the following standard proteins (marketed by AMERSHAM): 

40 - lysozyme-( 14 C)-methyl (MW: 14,300). 

- carbon dioxide-( 14 C)-methyl (MW: 30,000), 

- ovaJbumin-( 14 C)-methyl (MW: 46,000), 

- bovin albumin serum ( 14 C)-methyl (MW: 69,000), 

- phosphorylase b-( 14 C)-methyl (MW: 92,500), 
45 - myosine-( 14 C)-methyl (MW: 200,000). 

The intention relates also to the antigens themselves, particularly that of molecular weight of about 
110,000-120.000, which possess also the capability of being recognised by serums of patients infected with 
AIDS or LAS or by serums of persons who have been exposed to LAV viruses or those analogous with the 
latter. These antigens have also the characteristic of forming complexes with concanavaline A, said complex 

so being dissociatable in the presence of O-methyl-a-D-mannopyranoside. The antigens according to the 
invention can also bind to other lectins for example those known under the name "LENTYL-LECTIN". The 
preferred antigen according to the invention, of molecular weight 110,000, is also sensitive to the action of 
endoglycosidases. This action is manifested by the production from the antigen of molecular weight 
110,000 of a protein having a molecular weight of the order of 90.000, the latter being separable for 

55 example by immunoprecipitation or by separation employing the differences in molecular weights 
(migrations differentiated on gel). 

Preferred antigens of the invention are constituted by glycoproteins. 

The invention relates also to the process for producing the viruses according to the invention. This 
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process distinguishes essentially from those recalled above at the level of the final purification operation. In 
particular, the purification step of the process according to the invention Is no longer carried out in 
gradients, but involves the performance of differential centrifugations effected directly on the supernatants 
of the culture media of the producing cells. These centrifugation operations comprise particularly a first 

5 centrifugation at an angular centrifugation velocity, particularly of 10.000 rpm, enabling the removal of 
nonviral constituents, more particularly of cellular constituents, then a second centrifugation at higher 
angular velocity, particularly at 45,000 rpm, to obtain the precipitation of the virus itself. In preferred 
embodiments, the first centrifugation at 10,000 rpm, is maintained for 10 minutes and the second at 45,000 
rpm, for 20 minutes. These are, of course, only indicative values, it being understood that it remains within 

70 the ability of the specialist to modify the centrifugation conditions, to provide for the separation of the 
cellular constituents and of the viral constituents. 

This modification of the purification process results in the production of viral preparations from which 
the antigen mentioned can then be isolated more easely. than from virus preparations purified by the 
previous methods. In any event, the viruses finally obtained by the process of the present invention are 

75 more easely recognised by serums of patients or of persons who have been exposed to the LAV virus or to 
morphologically and antigenically similar strains. 

The antigens according to the invention can themselve be obtained from the above disclosed viruses, 
by lysis (or other suitable processing) of the latter in the presence of any suitable detergent and by 
recovery and separation of the antigens released. Advantageously, the lysis of the virus is effected in the 

20 presence of aprotinin or of any other agent suitable for inhibiting the action of. proteases. The separation of 
the antigens according to the invention can then be carried out by any method known in itself ; for example, 
it is possible to proceed with a separation of the proteins by employing their respectively different 
migrations in a predetermined gel, the protein sought being then isolated from the zone of the gel in which 
it would normally be found in an electrophoresis operation under well determined conditions, having regard 

25 to its molecular weight. The antigens according to the invention can however be separated from the lysate 
of the abovesaid viruses, due to their affinity for lectins, in particular concanavaline A or lentyHectin. The 
lectin used is preferably immobilised on a solid support, such as the cross linked polymer derived from 
agarose and marketed under the trade mark SEPHAROSE. After washing of the fixed antigens with a 
suitable buffer, the antigens can be eluted in any suitable manner, particularly by resorting to a O-methyl-o- 

30 D-mannopyranoside in solution. 

A more thorough purification of these antigens can be performed by immunoprecipitation with the 
serums of patients known to possess antibodies effective against said protein, with concentrated antibody 
preparations (polyclonal antibodies) or again with monoclonal antibodies, more particularly directed against 
the antigen according to the invention, in particular that having the molecular weight of 110.000, denoted 

35 below by the abbreviation gp110. 

Additional characteristics of the invention will appear also in the course of the description which follows 
of the isolation of a virus according to the invention and of antigens, particularly an envelope antigen of the 
virus, reference will be made to the drawings in which : 

Figure 1 is derived from a photographic reproduction of gel strips which have been used to carry out 

40 electrophoreses of lysate extracts of T lymphocytes, respectively infected and uninfected (controls) by a 
LAV suspension. 

Figure 2 is the restriction map of a complete LAV genome (clone XJ19) 
Figures 3a to 3e are the complete sequence of a LAV viral genome. 

Figures 4 and 5 show diagrammatically parts of the three possible reading phases of LAV genomic 
45 RNA, including the open reading frames (ORF) apparent in each of said reading phases. 
Figure 6 is a schematic representation of the LAV long terminal repeat (LTR). 

I - PRODUCTION OF THE VIRUS AND OF ANTIGENS 

50 

T lymphocytes derived from a healthy donor and infected with LAV1 , under the conditions described by 
F. BARRE-SINOUSSI et Coll., on CEM cells derived from a patient afflicted with leukemia and also infected 
in vitro with LAV1, were kept under cultivation in a medium containing 200 microcuries of 35 S-cystein and 
devoid of unlabelled cystein. The infected lymphocytes were cultured in a non denaturating medium to 
55 prevent the degradation of the antigen sought The supernatant liquor from the culture medium was then 
subjected to a first centrifugation at 10,000 rpm for 10 minutes to remove the non viral components, then to 
a second centrifugation at 45,000 rpm for 20 minutes for sedimenting the virus, the virus pellet was then 
lysed by detergent in the presence of aprotinin (5 %) particularly under the conditions described in the 
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article of F. BARRE-SINOUSSI et Coll. 

The same operation was repeated on lymphocytes taken up from a healthy donor as control. 

The various lysates were then immuno-precipitated by serums of patients infected with AIDS or with 
LAS. Serums originating from healthy donors or of donors infected with other diseases were im- 
5 munoprecipitated too. The media were then subjected to electrophoreses in a SDS-polyacrylamide gel. 

The results are indicated in figure 1. The gel strips numbered from 1 to 6 were obtained from 
preparations labelled by 3S S-cystein. The strips numbered 7 to 10 show results observed on infected or 
uninfected lymphocyte preparations label led with 35 S-methionine. Finally the strip M corresponds to the 
migration distances of the standard proteins identified above, whose molecular weights are recalled in the 
io right hand portion of the figure. 

The references to the labelled viral proteins appear on the left handside of the figure. 

It is noted that columns 7 to 10 show the specific protein p25 of LAV, labelled with 35 S-methionin. The 
same protein is absent on strips 8 to 10 corresponding to results obtained with a preparation originating 
from healthy lymphocytes. 

75 Columns 3 and 5 correspond to the results which have been observed on preparations obtained from 
lymphocytes infected and labelled with 35 S-cystein. The proteins p25 and p18. the characteristic core 
proteins of LAV, and the glycoprotein gp110, also specific of LAV, were also present. Images corresponding 
to a protein p4l (molecular weight of the order of 41,000) appeared in the various preparations, although 
less distinctly. 

20 The virus according to the invention and the antigen according to the invention can be either 
precipitated by lectins, particularly concanavaline A. or fixed to a SEPHAROSE-concanavaline A column. 
Particularly the purification of the envelope glycoproteins can be carried out as follows. This fixation can 
particularly be carried out by contacting a lysate of the LAV virus dissolved in a suitable buffer with 
concanavaline-A bound to SEPHAROSE. A suitable buffer has the following composition : 

25 



30 



35 



Tris 


10 mM 


NaCI 


0.15 M 


CaCk 


1 mM 


MgCb 


1 mM 


Detergent marketed 


under the trade mark 


TRITON 1 % 


PH 


7.4 



When the fixation has been achieved, the SEPHAROSE-concanavaline A is washed with a buffer of the 
same composition, except that the TRITON concentration is lowered to 0.1 %. The elution is then effected 
40 w ith an 0.2 M O-methyl-a-D-mannopyranoside solution in the washing buffer. 

The protein may be further concentrated by immuno-precipitation with antibodies contained in the 
serums of patients infected with AIDS or with polyclonal antibodies obtained from a serum derived from an 
animal previously immunised against the "unaltered" virus according to the invention or the abovesaid 
glycoprotein. The protein can then be recovered by dissociation of the complex by a solution having an 
45 adequate content of ionic salt. Preferably the antibody preparation is itself immobilised in a manner known 
in itself on an insoluble support, for instance of the SEPHAROSE B type. 

It is also possible to resort to monoclonal antibodies secreted by hybridomas previously prepared 
against gp 110. These monoclonal antibodies, as well as the hybridomas which produce them, also form 
part of the invention. 

so A technique for producing and selecting monoclonal antibodies directed against the gp110 glycoprotein 
is described below. 



Immunisation of the mice 

Groups of Balb/c mice from 6 to 8 weeks old mice were used. One group receives the virus carrying 
the abovesaid glycoprotein, another a purified glycoprotein gp110. The immunisation procedure, identical 
for all mice, comprises injecting 10 mg of the antigenic preparation in the prsence of Freund complete 
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adjuvant at day 0, then again but in the presence of Freund incomplete adjuvant at day 14 and without 
adjuvant at days 28 and 42. The three first injections are made intraperitoneal^, the fourth intravenously. 



5 Fusion and culture of the hybrids 

The non secreting myeloma variant 5.53 P3 x 63 Ag8, resistant to azaguanine. itself derived from the 
MOPC-21 cell-line, is used. Fusion with immunised mouse splenocytes is carried out in the presence of 
polyethylene-glycol 4000 by the technique of FAZEKAS de st-GROTH and SCHEIDEGGER on the 45th 

70 day. The selection of the hybrids in RPMI 16-40 tt HAT" medium is carried, out in plates having 24 cups 
(known under the designation COSTAR) by resorting to the same culture techniques. 

The hybridomas producing antibodies of adequate specificity are then cloned in plates having 96 cups, 
in the presence of a "feeder" layer of syngenic thymocytes. The producing clones thus selected are then 
expanded in 24 cup plates, still in the presence of thymocytes. When the confluence appears in one of the 

75 cups, the clone is injected intraperitoneal^ into a balb/c mouse which bad received an injection of 
PRISTANE 8 days previously and/or kept in liquid culture. 

Demonstration of the anti-LAV antibodies 

20 

Five different techniques enable characterisation of the clones producing antibodies of suitable specific- 
ity. In a first stage, the hybrids producing antibodies are determined by an ELISA test revealing mouse 
immunoglobulins in the supernatant liquors. From this first selection, supernatants are sought which have 
antibodies directed against viral constituents by means of an ELISA test revealing anti-LAV antibodies, or by 
25 immunofluorescence on the virus producing human cells. Finally the supernatant liquours are analysed by 
radioimmunoprecipitation of virus labelled with cystein and by the Western-Blot technique on viral prepara- 
tion which permit the determination of the specificities of these anti-LAV antibodies. 



30 RESULTS 

Cells obtained from the various fusions are placed under culture in 648 cups. Their microscopic 
examination shows that the majority of these cups contain a single hybrid clone capable of growing in a 
"HAT" selective medium. More than 50 % among them produce antibodies giving rise to a positive 

35 response under ELISA antivirus examination. The most representative fusions are tested by the Western- 
Blot technique and several of them are subcloned, taking into account their respective specificities 
reactivities in antivirus ELISA and their behaviours under the culturing conditions. Those hybrids which are 
more particularly selected are those which produce antibodies which selectively recognise the viral 
glycoprotein gp110 having a molecular weight of about 110 KD. All the sub clonings give rise to' clones 

40 producing antibodies which, after expression, are injected into syngenic mice. Analysis of the specificities of 
the antibodies present in the different ascites liquids confirm the specificity of the antibodies of said ascites 
with respect to gp110. 

The monoclonal antibodies obtained can themselves be employed to purify proteins containing an 
antigenic site also contained in gpl10. The invention relates therefore also to these processes of purification 

45 as such. This process is advantageously applied to virus lysates or T lymphocyte lysates or other cells 
producing LAV or the like, when care has been taken to avoid the uncontrolled separation of gp110 during 
the purification procedure of the virus, prior to lysys thereof. Needless to say that the process can also be 
applied to any solution containing gpllO or a protein, polypeptide or glycoprotein comprising an antigenic 
site normally carried by the envelope protein and recognised by the monoclonal antibody. For practising 

so this process, the monoclonal antibodies are advantageously immobilised on a solid support, preferably 
adapted to affinity chromatography operations. For example, these monoclonal antibodies are fixed to an 
agarose lattice with three-dimensional cross-linking, marketed under the trade mark SEPHAROSE by the 
Swedish company PHARMACIA A.G., for example by the cyanogen bromide method. 

The invention therefore also relates to a process for separating the antigens concerned, which process 

55 comprises contacting a mixture of antigens, including those of interest (for instance a virus lysate or 
extract), with an affinity column bearing the abovesaid monoclonal antibodies, to selectively fix polypep- 
tides, proteins or glycoproteins selectively recognized by said monoclonal antibodies, recovering the latter 
by dissociation of the antigen-antibody complex by means of a suitable buffer, particularly a solution of 

7 
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adequate ionic strength, for example of a salt, preferably ammonium acetate (which leaves no residue upon 
. freeze drying of the preparation or a solution acidified to a pH 2-4 or to a glycine buffer at the same pH and 
recovering the eluted polypeptides, proteins or glycoproteins. 

It is self-evident that the invention relates also to polypeptide fragments having lower molecular weights 

5 and carrying antigenic sites recognizable by the same monoclonal antibodies. It is clear to the specialist 
that the availabililty of monoclonal antibodies recognizing the gp110 glycoprotein gives also access to 
smaller peptide sequences or fragments containing the common antigenic site or epitope. Fragments of 
smaller sizes may be obtained by resorting to known techniques. For instance such a method comprises 
cleaving the original larger polypeptide by enzymes capable of cleaving it at specific sites. By way of 

io examples of such proteins, may be mentioned the enzyme of Staphylococcyus aureus V8, a- 
chymotrypsine, "mouse sub-maxi!lary gland protease" marketed by the BOEHRINGER company, Vibrio 
alginolyticus chemovar iophagus collagenase which specifically recognises said peptides Gly-Pro and Gly- 
Ala, etc.. 

It is also possible to obtain polypeptides or fragments of envelope antigens of the virus, by cloning 
rs fragments excised from a cDNA constructed from genoimes of LAV variants. 

Figures 2 and 3 are restriction maps of such a cDNA comprising a total of 9.1 to 9.2 kb. The 
polypeptides coded by cDNA fragments located in the region extending between site Kpnl (position 6100) 
and site Bglll (position 9150) of the restriction map of Figure 2. The presence of a characteristic site of an 
envelope antigen of the LAV virus or the like in any polypeptide expressed (in a suitable host cell 
20 transformed behorehand by a corresponding fragment or by a vector containing said fragment) can be 
detected by any suitable immunochemical means. 

Particularly the invention relates more particularly to polypeptides encoded by cDNA fragments defined 
hereafter. It also relates to the nucleic acid fragments themselves, including a cDNA variant corresponding 
to a whole LAV retroviral genome, characterized by a series of restriction sites in the order hereafter (from 
2S the 5 # end to the 3' end). 

The coordinates of the successive sites of the whole LAV genome (see also restriction map of XJ19 in 
fig. 1) are indicated hereafter too, with respect to the Hind III site (selected as of coordinate 1) which is 
located in the R region. The coordinates are estimated with an accuracy of ± 200 bp : 



40 



45 



SO 



Hind III 


0 


Sac I 


50 


Hind III 


520 


Pst I 


800 


Hind III 


1 100 


Bgl II 


1 500 


Kpn I 


3 500 


Kpn I 


3 900 


Eco Rl 


4 100 


Eco Rl 


5 300 


Sail 


5 500 


Kpn I 


6 100 


Bgl II 


6 500 


Bgl II 


7 600 


Hind III 


7 850 


Bam HI 


8 150 


Xhol 


8 600 


Kpn I 


8 700 


Bgl II 


8 750 


Bgl II 


9 150 


Sac I 


9 200 


Hind III 


9 250 



55 Another DNA variant according to this invention optionally contains an additional Hind III approximately 
at the 5 550 coordinate. 

Reference is further made to fig. 1 which shows a more detailed restriction map of said whole-DNA 
(XJ19). 
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An even more detailed nucleotidic sequence of a preferred DNA according to the invention is shown in 
figs. 4a-4e hereafter. 

The invention further relates to other preferred DNA fragments and polypeptide sequences 
(glycosylated or not glycosylated) which will be referred to hereafter. 

SEQUENCING OF LAV 

The sequencing and determination of sites of particular interest were carried out on a phage 
recombinant corresponding to XJ19 disclosed in the abovesaid British Patent application Nr. 84 23659. A 
method for preparing it is disclosed in that application. 

The whole recombinant phage DNA of clone XJ19 (disclosed in the earlier application) was sonicated 
according to the protocol of DEININGER (1983), Analytical Biochem. 129, 216. the DNA was repaired by a 
Klenow reaction for 12 hours at 16* C. The DNA was electrophoresed through 0.8 % agarose gel and DNA 
in the size range of 300-600 bp *vas cut out and electroeluted and precipitated. Resuspended DNA (in 10 
mM Tris, pH 8 ; 0,1 mM EDTA) was ligated into M13mp8 RF DNA (cut by the restriction enzyme Smal and 
subsequently alkaline phosphated), using T4 DNA-and RNA-ligases (Maniatis T et al (1982) - Molecular 
cloning - Cold Spring Harbor Laboratory). An E. coli strain designated as TG1 was used for further study. 
This strain has the following genotype : 
Alac pro, supE, thi.F'traD36. proAB, lacl q , 2AM15,r" 

This E. coli TGI strain has the peculiarity of enabling recombinants to be recognized easily. The blue 
colour of the cells transfected with plasmids which did not recombine with a fragment of LAV DNA is not 
modified. To the contrary cells transfected by a recombinant plasmid containing a LAV DNA fragment yield 
white colonies. The technique which was used is disclosed in Gene (1983), 26, 101. 

This strain was transformed with the ligation mix using the Hanahan method (Hanahan D (1983) J. Mol. 
Biol. 166, 557). Cells were plated out on tryptone-agarose plate with IPTG and X-gal in soft agarose. White 
plaques were either picked and screened or screened directly in situ using nitrocellulose filters. Their DNAs 
were hybridized with nick-translated DNA inserts of pUCl8 Hind III subclones of XJ19. This permitted the 
isolation of the plasmids or subclones of X which are identified in the table hereafter. In relation to this table 
it should also be noted that the designation of each plasmid is followed by the deposition number of a cell 
culture of E. coli TGI containing the corresponding plasmid at the "Collection Nationale des Cultures de 
Micro-organismes" (C.N.C.M.) of the Pasteur Institute in Paris. France. A non-transformed TGI cell line was 
also deposited at the C.N.C.M. under Nr. I-364. All these deposits took place on November 15, 1984. The 
sizes of the corresponding inserts derived from the LAV genome have also been indicated. 
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TABLE 



Essentia! features of the recombinant 
piasmids 


-pJ19-1 piasmid (I-365) 


0.5 kb 


Hind III - Sac I - Hind III 


- PJ19 - 17 piasmid (1-367) 


0.6 kb 


Hind ill - Pst 1 - Hind III 


-pJ19- 6 piasmid (I-366) 

Hind 111 (5') 

Bam HI 

Xho I 

Kpn I 

Bglll 

Sac J (3 ) 

Hind III 

- pJ19 - 13 piasmid (I-368) 

Hind III (S') 

Bgl II 

Kpn I 

Kpn I 

Eco Rl 

Eco Rl 

Sail 

Kpn I 

Bgl II 

Bgl II 

Hind III (3 ) 


1.5 kb 
6.7 kb 



Positively hybridizing M13 phage plates were grown up for 5 hours and the single-stranded DNAs were 
extracted. 

Ml3mp8 subclones of XJ19 DNAs were sequenced according to the dideoxy method and technology 
devised by Sanger et al (Sanger et al (1977), Proc. Natl. Acad. Sci. USA, 74 , 5463 and M13 cloning and 
sequencing handbook, AMERSHAM (1983). the 17-mer oligonucleotide primer o- 35 SdATP (400Ci/mmol, 
AMERSHAM), and 0.5X-5X buffer gradient gels (Biggen M.D. et al (1983. Proc. Natl. Acad. Sci. USA, 50, 
3963) were used. Gels were read and put into the computer under the programs of Staden (Staden R. 
(1982), Nucl. Acids Res. 10, 4731). All the appropriate references and methods can be found in the 
AMERSHAM M13 cloning and sequencing handbook. 

The complete DNA sequence of XJ19 (also designated as LAV-la) is shown in figs. 4 to 4e. 

The sequence was reconstructed from the sequence of phage XJ19 insert. The numbering starts at the 
cap site which was located experimentally (see hereafter). Important genetic elements, major open reading 
frames and their predicted products are indicated together with the Hindlll cloning sites. The potential 
glycosylation sites in the env gene are overlined. The NH 2 -terminal sequence of p25° a8 determined by 
protein microsequencing is boxed. 

Each nucleotide was sequenced on average 5.3 times : 85 % of the sequence was determined on both 
strands and the remainder sequenced at least twice from independent clones. The base composition is T, 
22.2 % ; C, 17.8 % ; A, 35.8 % ; G, 244.2 % ; G + C, 42 %. The dinucleotide GC is greatly under 
represented (0,9 %) as common amongst eukaryotic sequences (Bird 1980). 

Figs. 5 and 6 provide a diagrammatized representation of the lengths of the successive open reading 
frames corresponding to the successive reading phases (also referred to by numbers "1", "2" and m 3 m 
appearing in the left handside part of fig. 5). The relative positions of these open reading frames (ORF) with 
respect to the nucleotidic structure of the LAV genome is referred to by the scale of numbers representative 
of the respective positions of the corresponding nucleotides in the DNA sequence. The vertical bars 
correspond to the positions of the corresponding stop codons. 
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The following genes and DNA fragments can be distinguished on the different reading frames shown. 
Reference is then also made to the proteins or glycoproteins encoded by said genes and fragments. 



5 The envelope gene (or ORF-env) 

env : The env open reading frame has a possible initiator methionine codon very near the beginning 
(8th "triplet). If so the molecular weight of the presumed env precursor protein (861 aminoacids. MWcalc = 
97376) is consistent with the size of the LAV glycoprotein (110 kd and 90 kd after glycosidase treatment). 
io There are 32 potential N-giycosylation sites (Asn-X-Ser/Thr) which are overlined in Fig. 4d and 4e. An 
interesting feature of env is the very high number of Trp residues at both ends of the protein. 

The DNA sequence thought to code for envelope proteins is thought to extend from nucleotidic position 
5746 (starting with 5 AAA GAG GAG A....3') up to nucleotidic position 8208 (ending by .....A ACT AAA GAA 
3'). Polypeptidic structures of sequences of the envelope protein correspond to those read according to the 
;5 "phase 3" reading phase. 

The start of env transcription is thought to be at the level of the ATG codon at position 5767-5769. 

There are three hydrophobic regions, characteristic of the retroviral enveLope proteins (Seiki et al., 

1983) corresponding to a signal peptide (encoded by nucleotides 5815-5850 bp), a second region (7315- 
7350 bp) and a transmembrane segment (7831-7890 bp). The second hydrophobic region (7315-7350 bp) is 

20 preceeded by a stretch rich in Arg + Lys. It is possible that this represents a site of proteolytic cleavage, 
which by analogy with over retroviral proteins, would give an external envelope polypeptide and .a 
membrane associated protein (Seiki et al.. 1983. Kiyokawa et al., 1984). A striking feature of the LAV 
envelope protein sequence is that the segment following the transmembrane segment is of unusual length 
(150 residues). The env protein shows no homology to any sequence in protein data banks. The small 

25 aminoacid motif common to the transmembrane proteins of all leukemogenic retroviruses (Cianciolo et al., 

1984) is not present in lav env. 

The invention concerns more particularly the DNA sequence 5567. up to nucleotide 8208 or even 8350 
deemed to encode the gp 110 (envelope glycoprotein of the LAV virus which has a molecular weight of 
about 110.000 daltons) beginning at about nucleotide as well as the polypeptidic backbone of the 
30 glycoprotein sequence which corresponds to that having an approximate molecular weight of 90.000 daltons 
and which is obtained by glycosidase treatment of gp110. 

The invention further relates to the purified polypeptides which have the aminoacid structure (or 
polypeptidic backbone) of the gp110 and gp90 respectively, which correspond to the direct translation of 
the DNA sequences and fragments which have been defined more specifically hereabove (figs 4d and 4e). 
35 The invention further relates to polypeptides containing neutralizing epitopes. 

The locations of neutralizing epitopes are further apparent in fig. 4d. Reference is more particularly 
made to the overlined groups of three letters included in the aminoacid sequences of the envelope proteins 
(reading phase 3) which can be designated generally by the formula Asn-X-Ser or Asn-X-Thr. wherein X is 
any other possible aminoacid. Thus the initial protein product or polypeptide backbone of the env 
40 glycoprotein has a molecular weight in excess of 91,000. These groups are deemed to generally carry 
glycosylated groups. These Asn-X-Ser and Asn-X-Thr groups with attached glycosylated groups form 
together hydrophylic regions of the protein and are deemed to be located at the periphery of and to be 
exposed outwardly with respect to the normal conformation of the proteins. Consequently they are 
considered as being epitopes which can efficiently be brought into play in vaccine compositions. 
45 The invention thus concerns with more particularity peptide sequences included in the env-proteins and 
excizable therefrom (or having the same aminoacid structure), having sizes not exceeding 200 aminoacids. 

Preferred peptides of this invention (referred to hereafter as a, b, c. d, e, f) are deemed to correspond 
to those encoded by the nucleotide sequences which extend respectively between the following positions : 
a) from about 6171 to about 6276 
50 b) from about 6336 to about 6386 

c) from about 6466 to about 6516 

d) from about 6561 to about 6696 

e) from about 6936 to about 7006 

f) from about 761 1 to about 7746 

55 Other hydrophilic peptides in the env open reading frame are identified hereafter, they are defined 
starting from aminoacid 1 = lysine coded by the AAA at position 5746-5748 in4he LAV DNA sequence (figs 
4d and 4e) and then further numbered in accordance with their order with respect to the end sequence. The 
first and second numbers in relation to each peptide refer to their respective N-terminal and C-terminal 
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aminoacids. 

These hydrophilic peptides are : 
aminoacids 8-23 inclusive, i.e. Met-Arg-Val-Lys-Glu-Lys-Tyr-Gln-His-Leu-Trp-Arg-Trp-Gly-Trp-Lys- 
aminoacids 63-78 inclusive, i.e. Ser-Asp-Ala-Lys-AIa-Tyr-Asp-Thr-Glu-Val-His-Asn-Va!-Trp-A!a-Thr- 
5 aminoacids 82-90 inclusive, i.e. Va I- Pro-Thr- Asp-Pro- Asn-Pro-Gln-Glu- 

aminoacids 97-123 inclusive, i.e. Thr-Glu-Asn-Phe-Asn-Met-Trp-Lys-Asn-Asp-Met-Val-Glu-G!n-Met-His-Glu- 
Asp-lle-lle-Ser-Leu-Trp-Asp-Gln-Ser-Leu- 

aminocids 127-183 inclusive, i.e. Val-Lys-Leu-Thr-Pro-Leu-Cys-Val-Ser-Leu-Lys-Cys-Thr-Asp-Leu-Gly-Asn- 

Ala-Thr-Asn-Thr-Asn-Ser-Ser-Asn-Thr-Asn-Ser-Ser-Se^^ 
70 Cys-Ser-Phe-Asn-lle-Ser-Thr-Ser-lle-Ara-Gly-Lys-Val-Gln-Lys- 

aminoacids 197-201 inclusive, i.e. L( j-Asp-lle-lle-Pro-lle-Asp-Asn-Asp-Thr-Thr- 
aminocids 239-294 inclusive, i.e. Lys-Cys-Asn-Asn-Lys-Thr-Phe-Asn-Gly-Thr-^ 

Thr-Val-Gln-Cys-Thr-His-Gly-lle-Arg-Pro-W^ 

Val-VaMIe-Arg-Ser-Ala-Asn-Phe-Thr-Asp-Asn-Ala-Lys- 
75 aminocids 300-327 inclusivej.e. Leu-Asn-Gln-Ser-Val-Glu-Ile-Asn-Cys-Thr-Arg-Pro-Asn-Asn-Asn-Thr-Arg- 

Lys-Ser-lle-Arg-lle-Gln-Arg-Gly-Pro-Gly-Arg- 

aminoacids 334-381 inclusive i.e. Lys-lle-Gly-Asn-Met-Arg-Gln-Ala-His-Cys-Asn-lle^ 

Ala-Thr-Leu-Lys-Gln-lle-Ala-Ser-Lys-Leu-Arg-Glu-Gln-Phe-Gly-Asn-Asn-Lys-Thr-IIe-lle-Phe-Lys-Glu-G 

Ser-Gly-Gly-Asp-Pro- 

20 aminoacids 397-424 inclusive, i.e. Cys-Asn-Ser-Thr-Gln-Leu-Phe-Asn-Ser-Thr-Trp-Phe-Asn-Ser-Thr-Trp-Ser- 
Thr-Glu-Gly-Ser-Asn-Asn-Thr-Glu-Gly-3er-Asp- 

aminoacids 466-500 inclusive, i.e. Le-i-Thr-Arg-Asp-Gly-Gly-Asn-Asn-Asn-Asn-Gly-Ser-Glu-lle-Phe-Arg-Pro- 
Gly-Gly-Gly-Asp-Met-Arg-Asp-Asn-Trp-Arg-Ser-Glu-Leu-Tyr-Lys-Tyr-Lys-Val- 
aminoacids 510-523 inclusive, i.e. Pro-Thr-Lys-Ala-Lys-Arg-Arg-Val-Val-Gln-Arg-Glu-Lys-Arg- 
26 aminoacids 551-577 inclusive, i.e. Val-Gln-Ala-Arg-Gln-Leu-Leu-Ser-Gly-lle-Val-Gln-Gln-Gln-Asn-Asn-Leu- 
Leu-Arg-Ala-lle-Glu-Ala-Gln-Gln-His-Leu- 

aminoacids 594-603 inclusive, i.e. Ala-Val-Glu-Arg-Tyr-Leu-Lys-Asp-Gln-GIn- 
aminoacids 621-630 inclusive, i.e. Pro-Trp-Asn-Aia-Ser-Trp-Ser-Asn-Lys-Ser- 

aminoacids 657-679 inclusive, i.e. Leu-IIe-Glu-Glu-Ser-Gln-Asn-Gln-Gln-Glu-Lys-Asn-Glu-Gln-Glu-Leu-Leu- 
30 Glu-Leu-Asp-Lys-Trp-Ala- 

aminoacids 719-758 inclusive, i.e. Arg-Vai-Arg-Gln-Gly-Tyr-Ser-Pro-Leu-Ser-Phe-Gln-Thr-His-Leu-Pro-Thr- 
Pro-Arg-Gly-Pro-Asp-Arg-Pro-Glu-Gly-lle-Glu-Glu-Glu-Gly-Gly-Glu-Arg-Asp-Arg-Asp-Arg-Ser-lle- 
aminoacids 780-803 inclusive, i.e. Tyr-His-Arg-Leu-Arg-Asp-Leu-Leu-Leu-lte-Val-Thr-Arg-Me-Val-Glu-Leu- 
Leu-Gly-Arg-Arg-Gly-Trp-Giu- 

35 

Table 1 





orf 


1st 


Met 


stop 


No amino 


MWcalc 


40 




triplet 






acids 




gag 


312 


336 


1836 


500 


55841 




pol 


1631 


1934 


4640 


(1003) 


(113629) 




orfQ 


4554 


4587 


5163 


192 


22487 




env 


5746 


5767 


8350 


861 


97376 


45 


orf F 


8324 


8354 


8972 


206 


23316 




Location and sizes of viral open reading frames. 



so The invention concerns more particularly all the DNA fragments which have been more specifically 
referred to hereabove and which correspond to open reading frames. It will be understood that the man 
skilled in the art will be able to obtain them all, for instance by cleaving an entire DNA corresponding to the 
complete genome of a LAV species, such as by cleavage by a partial or complete digestion thereof with a 
suitable restriction enzyme and by the subseguent recovery of the relevant fragments. The different DNAs 

55 disclosed above can be resorted to also as a source of suitable fragments. The techniques disclosed 
hereafter for the isolation of the fragments which were then included in the plasmids referred to hereabove 
and which were then used for the DNA sequencing can be used. 

Of course other methods can be used. Some of them have been examplified in British Application Nr. 
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8423659 filed on September 19, 1984. Reference is for instance made to the following methods. 

a) DNA can be transfected into mammalian cells with appropriate selection markers by a variety of 
techniques, calcium phosphate precipitation, polyethylene glycol, protoplast-fusion, etc.. 

b) DNA fragments correspondina to genes can be cloned into expression vectors for E. coli , yeast- 
or mammalian cells and the resultant proteins purified. 

c) The provival DNA can be "shot-gunned" (fragmented) into procaryotic expression vectors to 
generate fusion polypeptides. Recombinant producing antigenically competent fusion proteins can be 
identified by simply screening the recombinants with antibodies against LAV antigens . 

The invention further refers more specifically to DNA recombinants, particularly modified vectors, 
including any of the preceding DNA sequences and adapted to transform corresponding microorganisms or 
cells, particularly eucaryotic cells such as yeasts, for instance saccharomyces cerevisiae, or higher 
eucaryotic cells, particularly cellr of mammals, and to permit expression of said DNA sequences in the 
corresponding microorganisms or cells. 

More particularly the invention relates to such modified DNA recombinant vectors modified by the 
abovesaid DNA sequences and which are capable of transforming higher eucaryotic cells particularly 
mammalian cells. Preferably any of the abovesaid sequences are placed under the direct control of a 
promoter contained in said vectors and which is recognized by the polymerases of said cells, such that the 
first nucleotide codons expressed correspond to the first triplets of the above-defined DNA-sequences. 
Accordingly this invention also relates to the corresponding DNA fragments which can be obtained from 
LAV genomas or corresponding cDNAs by any appropriate method. For instance such a method comprises 
cleaving said LAV genomas or cDNAs by restriction enzymes preferably at the level of restriction sites 
surrounding said fragments and close to the opposite extremities respectively thereof, recovering and 
identifying the fragments sought according to sizes, if need be checking their restriction maps or nucleotide 
sequences (or by reaction with monoclonal antibodies specifically directed against epitopes carried by the 
polypeptides encoded by said DNA fragments), and further if need be, trimming the extremities of the 
fragment, for instance by an exonucleolytic enzyme such as Bal31. for the purpose of controlling the 
desired nucleotide-sequences of the extremities of said DNA fragments or, conversely, repairing said 
extremities with Klenow enzyme and possibly ligating the latter to synthetic polynucleotide fragments 
designed to permit the reconstitution of the nucleotide extremities of said fragments. Those fragments may 
then be inserted in any of said vectors for causing the expression of the corresponding polypeptide by the 
cell transformed therewith. The corresponding polypeptide can then be recovered from the transformed 
cells, if need be after lysis thereof, and purified, by methods such as electrophoresis. Needless to say that 
all conventionnal methods for performing these operations can be resorted to. 

The invention also relates more specifically to cloned probes which can be made starting from any DNA 
fragment according to this invention, thus to recombinant DNAs containing such fragments, particularly any 
plasmids amplifiable in procaryotic or eucaryotic cells and carrying said fragments. 

Using the cloned DNA fragments as a molecular hybridization probe - either by labelling with 
radionucleotides or with fluorescent reagents - LAV virion RNA may be detected directly in the blood, body 
fluids and blood products (eg of the antihemophylic factors such as Factor VIII concentrates) and vaccines, 
i.e. hepatitis B vaccine. It has already been shown that whole virus can be detected in culture supernatants 
of LAV producing cells. A suitable method for achieving that detection comprises immobilizing virus onto a 
support, e.g. nitrocellulose filters, etc., disrupting the virion and hybridizing with labelled (radiolabeled or 
"cold" fluorescent- or enzyme-labelled) probes. Such an approach has already been developed for Hepatitis 
B virus in peripheral blood (according to SCOTTO J. et al. Hepatology (1983). 3. 379-384). 

Probes according to the invention can also be used for rapid screening of genomic DNA derived from 
the tissue of patients with LAV related symptoms, to see if the proviral DNA or RNA is present in host 
tissue and other tissues. 

A method which can be used for such screening comprises the following steps : extraction of DNA from 
tissue, restriction enzyme cleavage of said DNA, electrophoresis of the fragments and Southern blotting of 
genomic DNA from tissues, subseguent hybridization with labelled cloned LAV provival DNA. Hybridization 
in situ can also be used. 

Lymphatic fluids and tissues and other non-lymphatic tissues of humans, primates and other mamma- 
lian species can also be screened to see if other evolutionnary related retrovirus exist. The methods 
referred to hereabove can be used, although hybridization and washings would be done under non stringent 
conditions. 

The DNAs or DNA fragments according to the invention can be used also for achieving the expression 
of LAV viral antigens for diagnostic purposes. 

The invention relates generally to the polypeptides themselves, whether synthesized chemically isolated 
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from viral preparation or expressed by the different DNAs of the inventions, particularly by the ORFs or 
fragments thereof, in appropriate hosts, particularly procaryotic or eucaryotic hosts, after transformation 
thereof with a suitable vector previously modified by the corresonding DNAs. 

More generally, the invention also relates to any of the polypeptide fragments (or molecules, particularly 

5 glycoproteins having the same polypeptidic backbone as the polypeptides mentioned hereabove) bearing 
an epitope characteristic of a LAV protein or glycoprotein, which polypeptide or molecule then has N- 
terminal and C-terminal extremities respectively either free or, independently from each other, covalently 
bond to aminoacids other than those which are normally associated with them in the larger polypeptides or 
glycoproteins of the LAV virus, which last mentioned aminoacids are then free or belong to another 

jo polypeptidic sequence. Particularly the invention relates to hybrid polypeptides containing any of the 
epitope-bearing-polypeptides which h^ve been defined more specifically hereabove, recombined with other 
polypeptides fragments normally foreign to the LAV proteins, having sizes sufficient to provide for an 
increased immunogenicity of the epitope-bearing-polypeptide yet, said foreign polypeptide fragments either 
being immunogenically inert or not interfering with the immunogenic properties of the epitope-bearing- 

75 polypeptide. 

Such hybrid polypeptides which may contain up to 150, even 250 aminoacids usually consist of the 
expression products of a vector which contained ab initio a nucleic acid sequence expressible under the 
control of a suitable promoter or replicon in a suitable host, which nucleic acid seguence had however 
beforehand been modified by insertion therein of a DNA sequence encoding said epitope-bearing- 
20 polypeptide. 

Said epitope-bearing-polypeptides, particularly those whose N-terminal and C-terminal aminoacids are . 
free, are also accessible by chemical synthesis, according to technics well known in the chemistry of 
proteins. 

The synthesis of peptides in homogeneous solution and in solid phase is well known. 

25 In this respect, recourse may be had to the method of synthesis in homogeneous solution described by 
Houbenweyl in the work entitled "Methoden der Organischen Chemie" (Methods of Organic Chemistry) 
edited by E. WUNSCK, vol. 15-1 and II. THIEME. Stuttgart 1974. 

This method of synthesis consists of successively condensing either the successive aminoacids in 
twos, in the appropriate order or successive peptide fragments previously available or formed and 

30 containing already several aminoacyl residues in the appropriate order respectively. Except for the carboxyl 
and aminogroups which will be engaged in the formation of the peptide bonds, care must be taken to 
protect beforehand all other reactive groups borne by these aminoacyl groups or fragments. However, prior 
to the formation of the peptide bonds, the carboxyl groups are advantageously activated, according to 
methods well known in the synthesis of peptides. Alternatively, recourse may be had to coupling reactions 

35 bringing into play conventional coupling reagents, for instance of the carbodiimide type, such as 1-ethyl-3- 
(3-dimethyl-aminopropyI)-carbodiimide. When the aminoacid group used carries an additional amine group 
(e.g. lysine) or another acid function (e.g. glutamic acid), these groups may be protected by carbobenzoxy 
or t-butyloxycarbonyl groups, as regards the amine groups, or by t-butylester groups, as regards the 
carboxylic groups. Similar procedures are available for the protection of other reactive groups. For example, 

40 SH group (e.g. in cysteine) can be protected by an acetamidomethyl or paramethoxybenzyl group. 

In the case of progressive synthesis, aminoacid by aminoacid, the synthesis starts preferably by the 
condensation of the C-terminal aminoacid with the aminoacid which corresponds to the neighboring 
aminoacyl group in the desired seguence and so on, step by step, up to the N-terminal aminoacid. Another 
preferred technique can be relied upon is that described by R.D. Merrifteld in "solid phase peptide 

45 synthesis" (J. Am. Chem. Soc. 45. 2149-2154). 

In accordance with the Merrifield process, the first C-terminal aminoacid of the chain is fixed to a 
suitable porous polymeric resin, by means of its carboxylic group, the amino group of said aminoacid then 
being protected, for example by a t-butyloxycarbonyl group. 

When the first C-terminal aminoacid is thus fixed to the resin, the protective group of the amine group 

so is removed by washing the resin with an acid, i.e. trifluoroacetic acid, when the protective group of the 
amine group is a t-butyloxycarbonyl group. 

Then the carboxylic group of the second aminoacid which is to provide the second aminoacyl group of 
the desired peptidic sequence, is coupled to the deprotected amine group of the C-terminal aminoacid fixed 
to the resin. Preferably, the carboxyl group of this second aminoacid has been activated, for example by 

55 dicyclohexylcarbodiimide while its amine group has been protected, for example by a t-butyloxycarbonyl 
group. The first part of the desired peptide chain, which comprising the first two aminoacids, is thus 
obtained. As previously, the amine group is then deprotected, and one can furtherj)roceed with the fixing of 
the next aminoacyl group and so forth until the whole peptide sought is obtained. 
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The protective groups of the different side groups, if any, of the peptide chain so formed can then be 
removed. The peptide sought can then be detached from the resin, for example, by means of hydrofluoric 
acid, and finally recovered in pure form from the acid solution according to conventional procedures. 

As regards the peptide seguences of smallest size and bearing an epitope or imunogenic determinant, 
s and more particularly those which are readily accessible by chemical synthesis, it mav be reguired, in order 
to increase their in vivo immunogenic character, to couple or "conjugate" them covalently to a physiologi- 
cally acceptable and non toxic carrier molecule. 

By way of examples of carrier molecules or macromolecular supports which can be used for making the 
conjugates according to the invention, will be mentioned natural proteins, such as tetanic toxoid, ovalbumin, 
io serum-albumins, hemocyanins, etc.. Synthetic macromolecular carriers, for example polysines or poly(D-L- 
alanine)-poly(L-tysine)s, can be u„ed too. 

Other types of macromolecular carriers which can be used, which generally have molecular weights 
• higher than 20,000, are known from the literature. 

The conjugates can be synthesized by known processes, such as described by Frantz and Robertson 
is in "Infect and Immunity-, 33, 193-198 (1981), or by P.E. Kauffman in purified and Enviromental Microbi- 
ology. October 1981, Vol. 42, n # 4, 611-614. 

For instance the following coupling agents can be used : glutaric aldehyde, ethyl chloroformate, water- 
soluble carbodiimides (N-ethyl-N'(3-dimethylaminopropyl) carbodiimide, HCI), diisocyanates, bis-diazoben- 
zidine, di- and trichloro-s-triazines, cyanogen bromides, benzaquinone, as well as coupling agents men- 
20 tioned in "Scand. J. Immunol., 1978. vol. 8, p. 7-23 (Avrameas, Ternynck, Guesdon). 

Any coupling process can be used for bonding one or several reactive groups of the peptide, on the 
one hand, and one or several reactive groups of the carrier, on the other hand. Again coupling is 
advantageously achieved between carboxyl and amine groups carriefd by the peptide and the carrier or 
vice-versa in the presence of a coupling agent of the type used in protein synthesis, i.e. 1 -ethyl-3-(3- 
25 dimethylaminopropyl)-carbodiimide, N-hydroxybenzotriazole, etc.. Coupling between amine groups respec- 
tively borne by the peptide and the carrier can also be made with glutaraldehyde, for instance, according to 
the method described by BOQUET, P. et al. (1982) Molec. Immunol., 19, 1441-1549, when the carrier is 
hemocyanin. 

The immunogenicity of epitope-bearing-peptides can also be reinforced, by oligomerisation thereof, for 
30 example in the presence of glutaraldehyde or any other suitable coupling agent. In particular, the invention 
relates to the water soluble immunogenic oligomers thus obtained, comprising particularly from 2 to 10 
monomer units. 

The glycoproteins, proteins and polypeptides (generally designated hereafter as "antigens" of this 
invention, whether obtained in a purified state from LAV virus preparations or by DNA- recombinant 
35 technology, are useful in processes for the detection of the presence of anti-LAV antibodies In biological 
media, particularly biological fluids such as sera from man or animal, particularly with a view of possibly 
diagnosing LAS or AIDS. 

Particularly the invention relates to an in vitro process of diagnosis making use of an envelope 
glycoprotein (or of a polypeptide bearing an epitope of this glycoprotein) for the detection of anti-LAV 
40 antibodies in the serums of persons who carry them. Other polypeptides - particular those carrying an 
epitope of a core protein - can be used too. 

A preferred embodiment of the process of the invention comprises : 

- depositing a predetermined amount of one or several of said antigens in the cups of a titration microplate; 

- introducing of increasing dilutions of the biological fluid, i.e. serum to be diagnosed into these cups : 
45 - incubating the microplate 

- washing carefully the microplate with an appropriate buffer : 

- adding into the cups specific labelled antibodies directed against blood immunoglobulins and 

- detecting the antigen-antibody-complex formed which is then indicative of the presence of LAV antibodies 
in the biological fluid. 

so Advantageously the labelling of the anti-immunoglobulin antibodies is achieved by an enzyme selected 
from among those which are capable of hydrolysing a substrate, which substrate undergoes a modification 
of its radiation-absorption, at least within a predetermined band of wavelenghts. The detection of the 
substrate, preferably comparatively with respect to a control, then provides a measurement of the potential 
risks or of the effective presence of the disease. 

55 Thus preferred methods immuno-enzymatic or also immunofluorescent detections in particular accord- 
ing to the ELISA technique. Titrations may be determinations by immunofluorescence or direct or indirect 
immuno-enzymatic determinations. Quantitative titrations of antibodies on the serums studied can be made. 
The invention also relates to the diagnostic kits themselves for the in vitro detection of antibodies 
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against the LAV virus, which kits comprise any of the polypeptides identified herein, and all the biological 
and chemical reagents, as well as equipment necessary for peforming diagnostic assays. Preferred kits 
comprise all reagents reguired for carrying out ELISA assays. Thus preferred kits will include, in addition to 
any of said polypeptides, suitable buffers and anti-human immunoglobulins, which anti-human im- 
5 munogiobulins are labelled either by an immunofiuorescent molecule or by an enzyme. In the last instance 
preferred kits then also comprise a substrate hydrolysable by the enzyme and providing a signal, 
particularly modified absorption of a radiation, at least in a determined wavelength, which signal is then 
indicative of the presence of antibody in the biological fluid to be assayed with said kit. 

The invention also relates to vaccine compositions whose active principle is to be constituted by any of 
70 the antigens, i.e. the hereabove disclosed polypeptides whole antigens, particularly the purified gp110 or 
immunogenic fragments thereof, fusion polypeptides or oligopeptides in association with a suitable phar- 
maceutical or physiologically acceptable carrier. 

A first type of preferred active principle is the gp110 immunogen. 

Other preferred active principles to be considered in that fields consist of the peptides containing less 

75 than 250 aminoacid units, preferably less than 150, as deducible for the complete genomas of LAV, and 
even more preferably those peptides which contain one or more groups selected from Asn-X-Ser and Asn- 
X-Ser as defined above. Preferred peptides for use in the production of vaccinating principles are peptides 
(a) to (f) as defined above. By wav of example having no limitative character, there may be mentioned that 
suitable dosages of the vaccine compositions are those which are effective to elicit antibodies in vivo, in the 

20 host particularly a human host. Suitable doses range from 10 to 500 micrograms of polypeptide, protein or 
glycoprotein per kg, for instance 50 to 100 micrograms per kg. 

The different peptides according to this invention can also be used themselves for the production of 
antibodies, preferably monoclonal antibodies specific of the different peptides respectively. For the produc- 
tion of hybridomas secreting said monoclonal antibodies, conventional production and screening methods 

25 are used. These monoclonal antibodies, which themselves are part of the invention then provide very useful 
tools for the identification and even determination of relative proportions of the different polypeptides or 
proteins in biological samples, particularly human samples containing LAV or related viruses. 

The invention further relates to the hosts (procaryotic or eucaryotic cells) which are transformed by the 
above mentioned recombinants and which are capable of expressing said DNA fragments. 

30 Finally the invention also concerns vectors for the transformation of eucaryotic cells of human origin, 
particularly lymphocytes, the polymerases of which are capable of recognizing the LTRs of LAV. Particularly 
said vectors are characterized by the presence of a LAV LTR therein, said LTR being then active as a 
promoter enabling the efficient transcription and translation in a suitable host of a DNA insert coding for a 
determined protein placed under its controls. 

35 Needless to say that the invention extends to all variants of genomes and corresponding DNA 
fragments (ORFs) having substantially equivalent properties, all of said genomes belonging to retroviruses 
which can be considered as equivalents of LAV. 

tt must be understood that the claims which follow are also intended to cover all equivalents of the 
products (glycoproteins, polypeptides, DNAs, etc.), whereby an equivalent is a product i.e. a polypeptide 

ao which may distinguish from a determined one defined in any of said claims, say through one or several 
aminoacids, while still having substantially the same immunological or immunogenic properties. A similar 
rule of equivalency shall apply to the DNAs, it being understood that the rule of equivalency will then be 
tied to the rule of equivalency pertaining to the polypeptides which they encode. 

It will also be understood that all the iitterature referred to hereinbefore or hereinafter, and all patent 

45 applications or patents not specifically identified herein but which form counterparts of those specifically 
designated herein must be considered as incorporated herein by reference. 
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Claims 

40 

1/ Purified peptide excizable from a purified envelope glycoprotein of the LAV virus having a molecular 
weight of the order of 110,000, particularly of 110,000-120,000 or from a polypeptide which has the same 
polypeptide backbone structure as said glycoprotein, which peptide has a size not exceeding 200 
aminoacids. 

45 21 Purified peptide according to claim 1, characterised in that it contains at least one neutralizing 

epitope, such an epitope comprising more particularly an amino acid sequence having the formula Asn-X- 

Ser or Asn-X-Thr, wherein X is any possible aminoacid. 

3/ Purified peptide according to claim 1 or 2 characterised in that ft is encoded by the nucleotide 

sequence which extend respectively between the following positions: 
so a/ nucleotides 6171 to 6276 corresponding to the following aminoacid sequence 

AsnAlaThrAsnThrAsnSerSerAsnThrAsnSerSerSerGlyGluMetMetMetGluLysGIyGlulleLysAsnCysSerPheAsnll- 

eSerThrSerlle 

or 

b/ nucleotides 6637 to 6387 corresponding to the following aminoacid sequence 
55 AsnAspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThr 

cJ nucleotides 6466 to 6516 corresponding "to the following aminoacid sequence 
AsnAsnLysThrPheAsnGlyThrGlyProCysThrAsnValSerThrVal 
or 
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d/ nucleotides 6562 to 6696 corresponding to the following aminoacid sequence 
LeuAsnGlySerLeuAlaGluGluGluValVallleArgSerAlaAsnPheThrAspAsnAlaLy^ 
G lu I le AsnCy sThr Arg Pro Asn Asn AsnThr Arg Ly s , or 

e/ nucleotides 6936 to 7005 corresponding to the following aminoacid sequence 
5 AsnSerThrGlnLeuPheAsnSerThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnLeuThr, or 
if nucleotides 7612 to 7746 corresponding to the following aminoacid sequence 
AsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrpAsnAsnMetThrTrpMetGluTrpAspArgGlulleAsnAsnTyrThrSerLeu- 
IleHisSerLeulleGluGluSerGlnAsnGlnGlnGluLys 

4/ Purified peptide according to claim 1 . characterized in that it is selected from the following group, the 
w first and last aminoacids corresponding respectively to the N-terminal and C-terminal aminoacids, 
aminoacids 8-23 inclusive, i.e. Met-Arg-Val-Lys-Glu-Lys-Tyr-Gln-His-Leu-Trp-Arg-Trp-Gly-Trp-Lys- 
aminoacids 63-78 inclusive, i.e. Ser-Asp-Ala-Lys-Ala-Tyr-Asr>Thr-Glu-Val-His-Asn-Val-Trp-Ala-Thr- 
aminoacids 82-90 inclusive, i.e. Val-Pro-Thr-Asp-Pro-Asn-Pro-Gln-Glu- 

aminoacids 97-123 inclusive, i.e. Thr-Glu-Asn-Phe-Asn-Met-Trp-Lys-Asn-Asp-Met-Val-Glu-Gln-Met-His-Glu- 
75 Asp-lle-lle-Ser-Leu-Trp-Asp-Gln-Ser-Leu- 

aminoacids 127-183 inclusive, i.e. Val-Lys-Leu-Thr-Pro-Leu-Cys-Val-Ser-Leu-Lys-Cys-Th^ 

Ala-Thr-Asn-Thr-Asn-Ser-Ser-Asn-Thr-Asn-Ser-Ser^^ 

Cys-Ser-Phe-Asn-lle-Ser-Thr-Ser-lle-Arg-Gly-Lys-Val-Gln-Lys- 

aminoacids 197-201 inclusive, i.e. Leu-Asp-He-lle-Pro-lle-Asp-Asn-Asp-Thr-Thr- 
20 aminoacids 239-294 inclusive, i.e. Lys-Cys-Asn-Asn-Lys-Thr-Phe-Asn-Gly-T^ 

Thr-Val-Gln-Cys-Thr-His-Gly-lle^ 
Val-Val-lle-Arg-Ser-Ala-Asn-Phe- Thr-Asp-Asn-Ala-Lys- 

aminoacids 300-327 inclusive i.e. Leu-Asn-Gln-Ser-Val-Glu-lle-Asn-Cys-Thr-Arg-Pro-Asn-Asn-Asn-Thr-Arg- 
Lys-Ser-lle-Arg-lle-Gln-Arg-Gly-Pro-Gly-Arg- 
25 aminoacids 334-381 inclusive, i.e. Lys-lle-Gly-Asn-Met-Arg-Gln-Aia-His-Cys-Asn^ 
Asn-Ala-Thr-Leu-Lys-Gln-lle-Ala-Ser-Lys-Leu-Arg-Glu^ 
Ser-Gly-Gly-Asp-Pro- 

aminoacids 397-424 inclusive, i.e. Cys-Asn-Ser-Thr-Gln-Leu-Phe-Asn-Ser-Thr-Trp-Phe-Asn-Ser-Thr-Trp-Ser- 

Thr-Glu-Gly-Ser-Asn-Asn-TTir-Giu-Gly-Ser-Asp- 
30 aminoacids 466-500 inclusive, i.e. Leu-Thr-Arg-Asp-Gly-Gly-Asn-Asn-Asn-Asn-Gly-Ser-Glu-lle-Phe-Arg-Pro- 

Gly-Gly-Gly-Asp-Met-Arg-Asp-Asn-Trp-Arg-Ser-Glu-Leu-Tyr-Lys-Tyr-Lys-Val- 

aminoacids 510-523 inclusive, Le. Pro-Thr-Lys-Ala-Lys-Arg-Arg-Val-Val-Gln-Arg-Glu-Lys-Arg- 

aminoacids 551-577 inclusive, i.e. Val-Gln-Ala-Arg-Gln-Leu-Leu-Ser-Gly-He-Val-Gln-Gln-Gln-Asn-Asn-Leu- 

Leu-Arg-Ala-lle-Glu-Ala-Gln-Gln-His-Leu- 
35 aminoacids 594-603 inclusive, i.e. Ala-Val-Glu-Arg-Tyr-Leu-Lys-Asp-Gln-GIn- 

aminoacids 621-630 inclusive, i.e. Pro-Trp-Asn-Ala-Ser-Trp-Ser-Asn-Lys-Ser- 

aminocids 657-679 inclusive, i.e. Leu-lle-Glu-Glu-Ser-Gln-Asn-Gln-Gln-Glu-Lys-Asn-Glu-Gln-Glu-Leu-Leu- 
Glu-Leu-Asp-Lys-Trp-Aia- 

aminoacids 719-758 inclusive, i.e. Arg-Val-Arg-Gln-Gly-Tyr-Ser-Pro-Leu-Ser-Phe-Gln-Thr-His-Leu-Pro-Thr- 
40 Pro-Arg-Gly-Pro-Asp-Arg-Pro-Glu-Gly-Me-Glu-Glu-Glu-Gly-Gly-Glu-Arg-Asp-Arg-Asp-Arg-Ser-ll^ 

aminoacids 780-803 inclusive, i.e. Tyr-His-Arg-Leu-Arg-Asp-Leu-Leu-Leu-Ile-Val-Thr-Arg- lle-Val-GIu-Leu- 

Leu-Gly-Arg-Arg-Gly-Trp-Glu- 

5/ Purified peptide according to claim 2, characterised in that it is selected from the following group, the 

first and last aminoacids corresponding respectively to the N-terminal and C-terminal aminoacids, 
45 aminoacids 127-183 inclusive, i.e. Val-Lys-Leu-Thr-Pro-Leu-Cys-Val-Ser-Leu^ 

Ala-Thr-Asn-Thr-Asn-Ser-Ser-Asn^ 

Cys-Ser-Phe-Asn-lle-Ser-Thr-Ser-lle-Arg-Gly-Lys-Val-Gln-Lys- 

aminoacids 197-201 inclusive, i.e. Leu- Asp- He- He-Pro- He- Asp- Asn- Asp-Thr-Thr- 

aminoacids 239-294 inclusive, i.e. Lys-Cys-Asn-Asn-Lys-Thr-Phe-Asn-Gly-Thr-Gly-Pro-Cys-Thr-Asn-Val-Ser- 
50 Thr-Val-Gln-Cys-Thr-His-Gly-lle-Ar 

Val-Val-lle-Arg-Ser-Ala-Asn-Phe-Thr-Asp-Asn-Ala-Lys- 

aminoacids 300-327 inclusive i.e. Leu-Asn-Gln-Ser-Val-Glu-lle-Asn-Cys-Thr-Arg-Pro-Asn-Asn-Asn-Thr-Arg- 
Lys-Ser-lle-Arg-lle-Gln-Arg-Gly-Pro-Gly-Arg- 

aminoacids 334-381 inclusive, i.e. Lys-He-Gly-Asn-Met-Arg-Gln-Ala-His-Cys-Asn-lle-Ser-Arg-Ala-Lys-Trp- 
55 Asn-Ala-Thr-Leu-Lys-Gln-lle-Ala-Ser-Lys-Leu-Arg^^^ 
Ser-Gly-Gly-Asp-Pro- 

aminoacids 397-424 inclusive, i.e. Cys-Asn-Ser-Thr-Gln-Leu-Phe-Asn-Ser-Thr-Trp-Phe-Asn-Ser-Thr-Trp-Ser- 
Thr-Glu-Gly-Ser-Asn-Asn-Thr-Glu-Gly-Ser-Asp- 
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aminoacids 466-500 inclusive, i.e. Leu-Thr-Arg-Asp-Gly-Gly-Asn-Asn-Asn-Asn-Gly-Ser-Glu-lle-Phe-Arg-Pro- 

Gly-Gly-Gly-Asp-Met-Arg-Asp-Asn-Trp-Arg-Ser-Glu*Leu-Tyr-Lys-Tyr-Lys-Val- 

aminoacids 621-630 inclusive, i.e. Pro-Trp-Asn-Ala-Ser-Trp-Ser-Asn-Lys-Ser- 

6/ Purified peptide according to anyone of claims 2 or 5, characterized in that it is an epitope-bearing 
5 peptide and particularly in that it is an immunogenic peptide. 

71 Purified peptide according to claim 6, under an oligomer form, especially a water soluble im- 
munogenic oligomer, 

8/ Hybrid polypeptide comprising a peptide according to anyone of claims 2, 5 or 6 and having N- 
terminal and C-terminal extremities, covalently bond to aminoacids other than those which are normally 
.70 associated with them in the gp110 of LAV, which last mentioned aminoacids are then free or belong to 
another polypeptidic sequence, particularly a sequence having a size sufficient to provide for an increased 
immunogenicity 

9/ Composition of peptides characterized in that it comprises at least 2 peptides according to anyone of 
claims 1 to 6. 

T5 10/ Immunogenic composition comprising a purified peptide according to claim 1, 5 or 6, or a purified 
oligomer according to claim 7, or a hybrid polypeptide according to claim 8, and a suitable pharmaceutical 
vehicle. 

11/ DNA sequence characterized in that it codes for a peptide according to anyone of claims 1-6. 
12/ DNA sequence according to claim 11 characterized in that it is inserted in a vector. 
20 13/ Process for the chemical synthesis of a peptide or polypeptide according to anyone of claims 1 to 
6, characterized by: 

- successively condensing either the successive aminoacids in twos, in the appropriate order or successive 
peptide fragments previously available or formed and containing already several aminoacyl residues in the 
appropriate order respectively, all the reactive groups other than the carboxyl and aminogroups which will 

25 be engaged in the formation of the peptide bond being protected, 

- possibly activating the carboxylgroups prior to the formation of the peptide bond, 

- recovering the peptide or polypeptide formed. 

14/ Process for the preparation of a peptide according to anyone of claims 1 to 6, characterised by: 

- the transformation of procaryotic or eucaryotic host with a suitable vector previously modified by the 
30 corresponding DNA according to claim 12 in conditions enabling the production of the desired peptide, 

- recovering and purifying the peptide thus obtained for instance by electrophoresis. 

15/ Process for the in vitro diagnosis of the presence of anti-LAV antibodies in biological media 
characterized in that it comprises contacting said biological fluid with the product of any of claims 1 to 5 
under conditions suitable for enabling a complex between said antibodies and said product to be formed 
35 and detecting said complex as indicative of the presence of said antibodies in said biological fluid. 

16/ Kit for the in vitro diagnosis of the presence of antibodies directed against LAV virus in fluids which 
kit comprises: 

- a peptide according to anyone of anyone claims 1 to 5 or a composition according to claim, 

- biological and chemical reagents necessary for performing the diagnostic assay such as suitable buffers, 
40 anti-human immunoglobulins previously by labelled with an immunofluorescent molecule or an enzyme, 

- when the anti-human immunoglobulins are labelled with an enzyme, a substrate hydrolysable by the 
enzyme and providing a signal indicative of the presence of antibody in the biological fluid. 
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